AITopics | spurious local minima

b139e104214a08ae3f2ebcce149cdf6e-Paper.pdf

Neural Information Processing SystemsApr-21-2026, 20:09:21 GMT

artificial intelligence, local minima, machine learning, (16 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.72)

Add feedback

How Much Restricted Isometry is Needed In Nonconvex Matrix Recovery?

Neural Information Processing SystemsMar-17-2026, 02:38:06 GMT

When the linear measurements of an instance of low-rank matrix recovery satisfy a restricted isometry property (RIP) --- i.e. they are approximately norm-preserving --- the problem is known to contain no spurious local minima, so exact recovery is guaranteed. In this paper, we show that moderate RIP is not enough to eliminate spurious local minima, so existing results can only hold for near-perfect RIP. In fact, counterexamples are ubiquitous: every $x$ is the spurious local minimum of a rank-1 instance of matrix recovery that satisfies RIP. One specific counterexample has RIP constant $\delta=1/2$, but causes randomly initialized stochastic gradient descent (SGD) to fail 12\% of the time. SGD is frequently able to avoid and escape spurious local minima, but this empirical result shows that it can occasionally be defeated by their existence. Hence, while exact recovery guarantees will likely require a proof of no spurious local minima, arguments based solely on norm preservation will only be applicable to a narrow set of nearly-isotropic instances.

artificial intelligence, machine learning, proceedings, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.60)

Add feedback

26e359e83860db1d11b6acca57d8ea88-Paper.pdf

Neural Information Processing SystemsFeb-19-2026, 15:24:57 GMT

Some recent results do consider residual-like elements (see discussion of related work below),butgenerallydonotapply tostandard architectures.

artificial intelligence, arxivpreprintarxiv, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > Canada > Quebec > Montreal (0.04)
Asia > Middle East > Israel (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

Add feedback

How Much Restricted Isometry is Needed In Nonconvex Matrix Recovery?

Richard Zhang, Cedric Josz, Somayeh Sojoudi, Javad Lavaei

Neural Information Processing SystemsFeb-15-2026, 08:09:25 GMT

Neural Information Processing Systems http://nips.cc/

local minima, local minimum, spurious local minima, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Alameda County > Berkeley (0.05)
Asia > Middle East > Jordan (0.04)
North America > Canada > Quebec > Montreal (0.04)
Asia > China (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.93)

Add feedback

A theory on the absence of spurious solutions for nonconvex and nonsmooth optimization

Cedric Josz, Yi Ouyang, Richard Zhang, Javad Lavaei, Somayeh Sojoudi

Neural Information Processing SystemsFeb-14-2026, 05:48:27 GMT

We study the set of continuous functions that admit no spurious local optima (i.e.

artificial intelligence, local minima, machine learning, (20 more...)

Neural Information Processing Systems

Country: North America > Canada > Quebec > Montreal (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.67)

Add feedback

ca92ff06d973ece92cecc561757d500e-Paper-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 22:12:46 GMT

critical point, matrix, mc-bm, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Illinois > Cook County > Evanston (0.04)
North America > United States > New York > New York County > New York City (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Afghanistan > Parwan Province > Charikar (0.04)

Genre: Research Report (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.71)

Add feedback

HowManySamplesisaGoodInitialPointWorthin Low-rankMatrixRecovery?

Neural Information Processing SystemsFeb-9-2026, 09:36:21 GMT

As a consequence, these global guarantees tend to be pessimistic, because the number of samples must be sufficiently large to eliminate spurious local minima everywhere, even at adversarial locations.

artificial intelligence, foc, soc, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Illinois (0.05)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology: Information Technology > Artificial Intelligence (0.94)

Add feedback

94c4dd41f9dddce696557d3717d98d82-Paper.pdf

Neural Information Processing SystemsFeb-9-2026, 09:36:15 GMT

foc, sample complexity, spurious local minima, (11 more...)

Neural Information Processing Systems

Country:

North America > United States > Illinois > Champaign County > Urbana (0.04)
North America > United States > Illinois > Champaign County > Champaign (0.04)
North America > Canada (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.70)

Add feedback

How many samples is a good initial point worth in Low-rank Matrix Recovery?

Neural Information Processing SystemsDec-24-2025, 07:33:18 GMT

Given a sufficiently large amount of labeled data, the nonconvex low-rank matrix recovery problem contains no spurious local minima, so a local optimization algorithm is guaranteed to converge to a global minimum starting from any initial guess. However, the actual amount of data needed by this theoretical guarantee is very pessimistic, as it must prevent spurious local minima from existing anywhere, including at adversarial locations. In contrast, prior work based on good initial guesses have more realistic data requirements, because they allow spurious local minima to exist outside of a neighborhood of the solution. In this paper, we quantify the relationship between the quality of the initial guess and the corresponding reduction in data requirements. Using the restricted isometry constant as a surrogate for sample complexity, we compute a sharp "threshold" number of samples needed to prevent each specific point on the optimization landscape from becoming a spurious local minima. Optimizing the threshold over regions of the landscape, we see that, for initial points not too close to the ground truth, a linear improvement in the quality of the initial guess amounts to a constant factor improvement in the sample complexity.

good initial point worth, low-rank matrix recovery, spurious local minima, (8 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.60)
Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

How Much Restricted Isometry is Needed In Nonconvex Matrix Recovery?

Neural Information Processing SystemsNov-20-2025, 23:16:49 GMT

When the linear measurements of an instance of low-rank matrix recovery satisfy a restricted isometry property (RIP) --- i.e. they are approximately norm-preserving --- the problem is known to contain no spurious local minima, so exact recovery is guaranteed. In this paper, we show that moderate RIP is not enough to eliminate spurious local minima, so existing results can only hold for near-perfect RIP. In fact, counterexamples are ubiquitous: every $x$ is the spurious local minimum of a rank-1 instance of matrix recovery that satisfies RIP. One specific counterexample has RIP constant $\delta=1/2$, but causes randomly initialized stochastic gradient descent (SGD) to fail 12\% of the time. SGD is frequently able to avoid and escape spurious local minima, but this empirical result shows that it can occasionally be defeated by their existence. Hence, while exact recovery guarantees will likely require a proof of no spurious local minima, arguments based solely on norm preservation will only be applicable to a narrow set of nearly-isotropic instances.

nonconvex matrix recovery, restricted isometry, spurious local minima, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.60)

Add feedback

Filters

Collaborating Authors

spurious local minima

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

b139e104214a08ae3f2ebcce149cdf6e-Paper.pdf

How Much Restricted Isometry is Needed In Nonconvex Matrix Recovery?

26e359e83860db1d11b6acca57d8ea88-Paper.pdf

How Much Restricted Isometry is Needed In Nonconvex Matrix Recovery?

A theory on the absence of spurious solutions for nonconvex and nonsmooth optimization

ca92ff06d973ece92cecc561757d500e-Paper-Conference.pdf

HowManySamplesisaGoodInitialPointWorthin Low-rankMatrixRecovery?

94c4dd41f9dddce696557d3717d98d82-Paper.pdf

How many samples is a good initial point worth in Low-rank Matrix Recovery?

How Much Restricted Isometry is Needed In Nonconvex Matrix Recovery?